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Abstract 

A new approach to stochastic integration is described, which is based 
\^») | on an a.s. pathwise approximation of the integrator by simple, symmet- 

ric random walks. Hopefully, this method is didactically more advanta- 
geous, more transparent, and technically less demanding than other exist- 
, ing ones. In a large part of the theory one has a.s. uniform convergence 

■ on compacts. In particular, it gives a.s. convergence for the stochastic 

integral of a finite variation function of the integrator, which is not cadlag 
in general. 
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1 Introduction 

The main purpose of the present paper is to describe a new approach to stochas- 
tic integration, which is based on an a.s. pathwise approximation of the integra- 
tor by simple, symmetric random walks. It is hoped that this method is peda- 
gogically more advantageous, more transparent, and technically less demanding 
than other existing ones. This way hopefully a larger, mathematically less ma- 
ture audience may get acquainted with a theory of stochastic integration. Since 
stochastic calculus plays an ever- increasing role in several applications (math- 
ematical finance, statistical mechanics, engineering, . . . ) nowadays, this aim 
seems to be justified. 

A basic feature of the present theory that, except for the integral of general 
predictable integrands, one always has almost sure uniform convergence on com- 
pact intervals. This is true for the approximation of the integrators, quadratic 
variations, local times, and stochastic integrals when the integrand is a C 1 or a 
finite variation function of the integrator. We believe that for a beginner it is 
easier to understand almost sure pathwise convergence than L 2 or in probability 
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convergence that typically appear in stochastic calculus textbooks. We mention 
that our method significantly differs from the a.s. converging approximation 
given by Karandikar Important tools in our approach are discrete versions 
of Ito's and Ito-Tanaka formulas. The continuous versions easily follow from 
these by a.s. pathwise convergence. (See earlier versions of it in [9] and [TO].) 
To our best knowledge, our approach is new in giving a.s. convergence for the 
stochastic integral of a finite variation function of the integrator, which is not 
cadlag in general. 

In the case of a more general, e.g. predictable integrand, our method is 
closer to the traditional ones, e.g. we too have I? convergence then. The 
only, hopefully interesting, difference is that in the approximating sums random 
partitions are used so that the values of the integrand are multiplied by ±1 
times a constant scaling factor. The signs are from an independent, symmetric 
coin tossing sequence. 

The most general integrators considered in this paper are continuous local 
martingales M. It is easy to extend this to semimartingalcs X that can be 
written X = M + A, where M is a continuous local martingale and A is a finite 
variation process. The reason is simple: integration with respect to such an A 
can also be defined pathwise. 

2 Preliminaries 

A basic tool of the present paper is an elementary construction of Brownian 
motion (BM). The specific construction we are going to use in the sequel, taken 
from [10] . is based on a nested sequence of simple, symmetric random walks 
(RW's) that uniformly converges to the Wiener process (=BM) on bounded 
intervals with probability 1. This will be called "twist and shrink" construction 
in the sequel. Our method is a modification of the one given by Frank Knight 
in 1962 0. 

We summarize the major steps of the "twist and shrink" construction here. 
We start with an infinite matrix of independent and identically distributed ran- 
dom variables X m (k), P {X m (k) = ±1} = i (m > 0, k > 1), defined on the same 
complete probability space (O, J 7 , P). (All stochastic processes in the sequel will 
be defined on this probability space.) Each row of this matrix is a basis of an ap- 
proximation of the Wiener process with a dyadic step size At = 2~ 2m in time and 
a corresponding step size Ax = 2~ m in space. Thus we start with a sequence 
of independent simple, symmetric RW's S m (0) — 0, S m (n) = Y)f.—-\ X m (k) 
(n>l). 

The second step of the construction is twisting. From the independent RW's 
we want to create dependent ones so that after shrinking temporal and spatial 
step sizes, each consecutive RW becomes a refinement of the previous one. Since 
the spatial unit will be halved at each consecutive row, we define stopping times 
by T m (0) = 0, and for k > 0, 

T m {k + 1) = ia\n{n : n > T m (k), \S m (n) - S m (T m (k))\ =2} (to > 1) 

These are the random time instants when a RW visits even integers, different 
from the previous one. After shrinking the spatial unit by half, a suitable modi- 
fication of this RW will visit the same integers in the same order as the previous 
RW. We operate here on each point w G 57 of the sample space separately, i.e. 
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we fix a sample path of each RW. We define twisted RW's S m recursively for 
k = 1,2, ... using S m -i, starting with So(n) = So(n) (n > 0) and S m (0) = 
for any m > 0. With each fixed m we proceed for k = 0,1,2,... successively, 
and for every n in the corresponding bridge, T m (k) < n < T m (k + 1). Any 
bridge is flipped if its sign differs from the desired: 



X m (n) 



X m {n) if S m (T m (k + 1)) - S m (T m (k)) = 2X m ^{k + 1), 
-X m (n) otherwise, 



and then S m (n) = S m (n — 1) + X m (n). Then S m (n) (n > 0) is still a simple 
symmetric RW [101 Lemma 1]. The twisted RWs have the desired refinement 
property: 

(T m+1 (k)) = 2S m (k) (m>0,k> 0). 

The third step of the RW construction is shrinking. The sample paths of 
S m {n) {n > 0) can be extended to continuous functions by linear interpolation, 
this way one gets S m (t) (t > 0) for real t. The mth "twist and shrink" RW is 
defined by 

B m (t) = 2- m S m {t2 2m ). 
Then the refinement property takes the form 

B m+1 (r m+1 {k)2-^ m+1 ^ = B m (k2- 2m ) (m>0 7 k>0). (1) 

Note that a refinement takes the same dyadic values in the same order as the 
previous shrunken walk, but there is a time lag in general: 

T m+ i(fc)2- 2 ( m+1 > - k2- 2m ^ 0. (2) 

It is clear that this construction is especially useful for local times, since a 
refinement approximates the local time of the previous walk, with a geometri- 
cally distributed random number of visits with half-length steps (cf. Theorem 
© below). 

Now we quote some important facts from [TU] and [T^ about the above RW 
construction that will be used in the sequel. 

Theorem A. On bounded intervals the sequence (B m ) almost surely uniformly 
converges as m — > oo and the limit process is Brownian motion W . For any 
C > 1, and for any K > and rn > 1 such that K2 2m > N(C), we have 

sup \W{t) - B m {t)\ > 27 CK} (log* K)imh- 

0<t<K 

where := K V 1 and log* K := (log If) V 1. 

(N(C) here and in the sequel denotes a large enough integer depending on 
C, whose value can be different at each occasion.) 

Conversely, with a given Wiener process W, one can define the stopping 
times which yield the Skorohod embedded RW's B m (k2~ 2m ) into W. For every 
m > let s m (0) = and 

s m (k+l) = mf{ S :s>s m (k),\W(s)-W(s m {k))\ = 2- m } [k > 0). (3) 
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With these stopping times the embedded dyadic walks by definition are 

B m {k2~ 2m ) = W(s m (k)) (m > 0, k > 0). (4) 

This definition of B m can be extended to any real t > by pathwise linear 
interpolation. 

If a Wiener process is built by the "twist and shrink" construction described 
above using a sequence (B m ) of nested RWs and then one constructs the Sko- 
rohod embedded RWs (B m ), it is natural to ask about their relationship. The 
next lemma explains that they are asymptotically identical. In general, roughly 
saying, (B m ) is more useful when someone wants to generate stochastic processes 
from scratch, while (B m ) is more advantageous when someone needs discrete 
approximations of given processes. 

Lemma A. For any C > 1, and for any K > and m > 1 such that K2 2m > 
N(C) take the following subset oftt: 

A* Km = j sup sup |2- 2 "T mi „(fc) - k2- 2m \ < (42 CKlog, K)^mH' 

{n>m k 

where T m „(fc) = T„oT„_io- • -oT m (k) for n > m > and k € [0,K2 2m ]. Then 

P{(AK, m ) c }< x _\ Y _ c {K2 2m f~ c . 

Moreover, \\m n ^ oa 2~ 2n T m>n {k) — t rn (k) exists almost surely and on A* K m we 
have 

B m (k2- 2m ) = W{t m {k)) (0 < k2- 2m < K), 
c f- ©■ further, almost everywh 

sup \s m {k) - k2- 2m \ < (42CKlog r K)?mh- m . 

Q<k2- 2m <K 

The next theorem shows that the rate of convergence of (B m ) to W is es- 
sentially the same as the one of (J5 m ), cf. Theorem lAl 

Theorem B. For any C > 1, and for any K > and m > 1 such that 
K2 2m > N(C) we have 

sup \W{t)-B m {t)\ > 27CK?(]og„K)imh~ 

0<t<K 

In the last part of this section we quote a result from [12] about an elementary 
definition of Brownian local time, based on the "twist and shrink" RWs (B m (t)). 
This is basically a finite-time-horizon version of a theorem of Revesz [7], in a 
somewhat sharper form. 

First, one can define one-sided, up and down local times ^m(kj %) an d ^m{k, x) 
of the "twisted" RWs (S m (j)) at a point x e Z up to time k € Z + as ^„(0, x) — 
and 

lt{k,x) = #{j : < j < k, S m (j) = x, S m (j + 1) = x± 1} (k > 1). (5) 
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Then the two-sided local time is l m (k, x) — ^(k, x) + ^(k, x). 

The one-sided (and two-sided) local times of the mth "twist and shrink" 
walk B m at a point x E 2~ m Z up to time t E 2~ 2m Z + are defined as 



Ct l (t,x) = 2- m it l (t2 2m , x2 m ) 



(6) 



corresponding to the fact that the spatial step size of B m is 2- m . Then x) 
can be extended to any t E R+ and x E R by linear interpolation, as a continuous 
two-parameter process. 

Theorem C. On any strip [0,K] x R f/ie sequence a;)) almost surely 

uniformly converges as m — > oo £o t/ie one /ia// 0/ £/ie Brownian local time 
£(t,x). For any C > 1, and for any K > and m > 1 swc/i i/iai -fT2 2m > iV(C), 
we /iawe 



sup 

(t,K)G[0,if]xI 



> 50 Cif, 4 (log, K)im*2-^ 
30 



< 



l_ 4 i-c 



(K2 Zm ) 



2m\l-C 



Similar statements hold for (C m (t,x)) as well. 

Very similar statements hold when the "twist and shrink" walks B m (t) are 
replaced by Skorohod embedded walks B m {t) in the definition © and © of 
local time. This local time will be denoted by C^(t, x) in the sequel. 

Corollary 1. On any strip [0,K] x R the sequence (£+(<;, x)) almost surely 
uniformly converges as m —> 00 to the one half of the Brownian local time 
C(t, x). For any C > I, and for any K > and m > 1 such that K2 2m > N(C), 
we have 



sup 

(t,x)e[0,K]xI 



2 - C-m+i&x) 



> 50 CKi (log, KYm>2-^ 
32 



< 



(K2^ m ) 



2m\l-C 



Similar statements hold for (C m (t,x)) as well. 

Proof. By Lemma El almost everywhere on an event A* K whose complement 
has probability going to zero exponentially fast with m, one has that 

B m (k2- 2m ) = W{t m (k)) = W(s m (k)) = B m (k2- 2m ) (0 < k2~ 2m < K). 

Thus on A* Km , for almost every uj and for all {t,x) E [0, K ] x R, £+(i, 2;) = 

Hence, by the triangle inequality 



sup \C + (t,x)~ £+ (t,s)| < sup |£+ (t, x) - £+ (t, a;) | + sup |£ + (t, x) - £+ (t,x)\, 

(t,x) (t.x) (t,x) 
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and by Theorem we obtain the above statement: 



sup 

(t,x)e[0,K]xl 



2 - £m+l(M) 



> 50 CKi (log, K)Zm*2- ! ? 



< 



sup 

(t,x)e[o,K]x 



32 



2 

2m\l-C 



C(t,x)-jC+(t,x) 



> 50 CfsT» 4 (log, iC)%'2" 



□ 



3 A discrete Ito— Tanaka formula and its conse- 
quences 

It is interesting that one can give discrete versions of Ito's formula and of Ito- 
Tanaka formula, which are purely algebraic identities, not assigning any prob- 
abilities to the terms. Despite this, the usual Ito's and Ito-Tanaka formulae 
follow fairly easily in a proper probability setting. 

Discrete Ito formulas are not new. Apparently, the first such formula was 
given by Kudzma in 1982 [BJ. A similar approach was recently used by Fujita 
[3] and Akahori [T]. Discrete Tanaka formulae were given by Kudzma [BJ and 
Csorgo-Revesz [3] . The elementary algebraic approach used in the present paper 
is different from these; it was introduced by the first author in 1989 [9]. 

First consider an arbitrary numerical sequence X r — ±1 (r > 1) and partial 
sums So = a G M, S n = a + {X\ + • • • + X n )Ax (n > 1), where Air > is the 
step-size. Let / : M — > M, and b = a + kAx, k G Z. We define trapezoidal sums 
of / with step Ax on the interval [a, b] by 

fl |fcM 1 1 

T^J(x)Ax := sgn(6 - a)Ax I -f(a) + £ f(a + sgn(& - a)jAx) + -f(b) \ 

where sgn(a;) = 1,0, —1 according as x > 0, x — 0, or x < 0, respectively. 

Similarly as above, one-sided, up and down local times C + (n, x) and C~ (n, x) 
(n > 0) of the series (S n ) can be defined with step Aa;: £ ± (0, x) := and 

£ ± (n, x) := Ax f/={j : < j < n, Sj = x. Sj+i = x ± Ax}, 

where n > 1 and x = a + kAx, fc G Z. The two-sided local time is C(n,x) := 
C + (n, x) + C~ (n, x). 



Lemma 1. Take any function / : K 
sequence X r = ±1 (r > lj. Lei So = 
Then the following equalities hold: 



-» 1 , a £ 1, step Ax > 0, and numerical 
a, S n = a + (X 1 H h X n )Ax (n>l). 



T^ s J(x)Ax = £ JW 7^- 1J X r A, (7) 
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(discrete Stratonovich formula) . Alternatively, 

T^ s J(x)Ax = f; f(S r ^)X r Ax + \ J2 ^jf -11 ^)' (8) 

r— 1 r— 1 

(discrete ltd 's formula). Finally, 

n 

T^ s J(x)Ax = J2 f(S r -i)X r Ax (9) 

r=l 

+ 5 E {^ + K^)(/(^ + Ax)-/(x))+£-(n, a; )(/(x)-/(x-Ax))} 

(discrete Ito-Tanaka formula). 
Proof. Algebraically, 

T x %J(x)^ - TtzmAx = X r Ax f -^+im 

= HS^XrAx+ y^- f {Sr - l] (Axf 
2 X r /\x 

= f(S r ^)X r Ax 

+ 2 AX E (f( X + AX ) ^ /( 2; )) 1 {S r -i=a ; ,S r =x+Ax} 

+ ^ A:E E (/W " /(^ ^ Al )) 1 {S r _i=x,S r = r -Ai}- 

xGa+ZAa: 

The first equality follows from the fact that if X r — 1, one has to add a term, 
while if X r = —1, one has to subtract a term in the trapezoidal sum. Then 
the second equality follows since 1/X r = X r , and the third equality is obvi- 
ously implied by the second. Summing up for r = l,...,n, the sum on the 
left telescopes, and from the three equalities one obtains the three formulae, 
respectively. □ 

Introducing the notation h±& x {x) :— (f(x ± Ax) — f (x)) / (±Ax) and com- 
paring ([B]) and (J!]), we obtain a discrete occupation time formula, cf. (TIT 



E hx r Ax{S r -i){Ax) 2 = ^2 {hAx{x)C + (n,x) + h-& x (x)C (n,x)}Ax. 

r=l i£o+ZAi 

Let us apply Lemma[T]with Skorohod embedded walks (B m (t)). Elementary 
calculus shows, cf. Theorem 6], that when g € C 2 (K) and we set / = 
g' in ([7]) or in ([8|), then, as m — > oo, the terms converge almost surely to 
the corresponding terms of the Stratonovich and Ito's formula, respectively. 
On one hand, this gives a definition of the Ito integral J f(W(s)) dW(s) and 
Stratonovich integral f Q f(W(s)) dW(s), with almost sure convergence of Ito 
sums 

L*2 2m j 

(f(W)-W)?:= f(B m ((r~l)2- 2m ))2- m X m (r), (10) 

r=l 
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and Stratonovich sums 



(f(W)oW)?:= £ f(B m ((r- 1)2^)) + 1^2-^)) 

r=l 

respectively. Here (X m (r))^L 1 is an independent, ±1 symmetric coin tossing 
sequence, 2- m X m (r) = B m (r2- 2m ) - B m {(r - l)2- 2m ). This essentially means 
that in the sums approximating the Ito integral we apply a partition with 
the Skorohod stopping times = s m (0) < s m (l) < ■•• < s m ([t2 2m \), since 
B m (r2~ 2m ) = W(s m (r)). Second, taking almost sure limits as m — > oo, one 
immediately obtains the corresponding Ito's formula 

g(W(t)) - g(W(0)) = / g'(W(s)) dW(s) + \ [ g"(W(s)) ds, (11) 
Jo L Jo 

and Stratonovich formula 

g(W(t))-g(W(0))= f g'(W(s))dW(s). 
Jo 

In the same way, now we show almost sure convergence of stochastic sums 
(f(W) ■ W)™, when g is the difference of two convex functions and / = g'_, its 
left derivative. Then we immediately obtain the Ito-Tanaka formula as well, 
with the help of ©. 

Theorem 1. Let g be the difference of two convex functions, g'_ be its left 
derivative, and let be the signed measure defined by the change of g'_ when 
restricted to compacts (the second derivative of g in the generalized function 
sense). Then for arbitrary K > 0, 



(g'_(W)-W)™- / g'_(W(s))dW(s) 
Jo 

almost surely as m — > oo, and for any t > 0, 



(12) 



sup 

t€[0,if] 



g(W(t))-g(W(0))= g'_(W(s))dW(s) + - C(t,x)n(dx). (13) 

Jo L J -oo 

Proof. The basic idea of the proof is that one may substitute Skorohod em- 
bedded walks B m (r2- 2m ) = W{s m {r)), B m (0) = W{0) = a e R into © to 
obtain 

(✓-0*0 -WO? = T x %^ 2m )*- 2m )g'_{x)2-™ 

\ J2 {£Ut,x)(9-(x + 2- m )-9'-(x)) 



2 



+C m {t,x){g'_(x)-g'_{x-2- m ))}- (14) 



Then it is enough to deal with the almost sure uniform convergence on compacts 
of the terms on the right side of (fT4|) , and that will imply the same convergence 
of the stochastic sum on the left side and will prove the Ito-Tanaka formula 
([TBI as well. 
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First we show (fTU)) for measures /x supported in a compact interval [— M, M]. 
Because of the linearity of (fT5|) . it can be supposed that g is convex. Then g'_ 
is non-decreasing, left-continuous, and g'_(x) is constant if x > M or x < — M, 
hence g'_ is bounded on K. The measure /i, defined by /z([a, b)) := g'_{b) — g'_{a) 
if a < b, is a regular, finite, positive Borel measure on M, with total mass 
fx(R)=g> + (M)-g'_(-M). 

We are going to prove (fT5)) pathwise. For this, let Qo ; P {Hq} — 1, denote a 
subset of the sample space Q, on which, as m — > oo, B m (t) uniformly converges 
to W(t) on [0, K] and x) uniformly converges to £ (i, cc) on [0, X] x R, cf. 

Theorem [Bl and Corollary [T] above. We fix an uj G Slo- Then, obviously, W(t) 
and ^^(t, a;) will have continuous paths. 

Consider the first term on the right side of Q3]). We want to show that it 
uniformly converges to g(W(t)) — g(W(0)) for t G [0,-ff]. With the notation 
t m := [t2 2m \2- 2m , 



sup 

te[o,K] 



W(t) 

T*^g'_{x)2-™ - I g'_(x)dx 

Bm(tm) 



< sup 
te[o,K] 



sup 

te[o,K] 



TWg'_(x)T 

W{t) 



g'_{x) dx 



B m (i„ 



g'_{x) dx 



(15) 



The first term on the right side of (|15|) goes to zero as m 



be- 



cause is non-decreasing, hence Riemann integrable on any compact inter- 
val [a, B m (t m )], the trapezoidal sum is a Riemann sum on the same interval, 
so their difference can be estimated from above by the difference of the upper 
sum and the lower sum of g'_ on [a, B m (t m )], which, in turn is dominated by 
2- m (g' + (M) - gL(-M)) for any t e [0,K]. 

The second term on the right side of (fT5"]) converges to as well, since B m (t m ) 
converges uniformly to W(t) for t G [0, K], and g'_ is bounded on R. 

Since the slope of g at any point is bounded above by g'AM) and below by 
g'_(—M), we see that g is Lipschitz, so absolutely continuous. Hence 



W(t) 
'IV(O) 

Thus we have proved that 



g'_(x)dx = g(W(t)) - g(W(0)). 



sup 

t£[0,K] 



g(W(t))-g(W(0))-T^ 



B m (L*2 2m J2" 2m ) / 



g'_{x)2- m ^0 (16) 



almost surely as m — » oo. 

Now we turn to the convergence of the second term on the right side of (fT4 
Fixing an u G ilo, we want to show that as m — > oo, 



sup 

te[o,K] 



xea+2 



/oo -, 



dx) 





(17) 
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(The other half containing the C m (t, x) terms is similar.) Now, we have that 

/OO i 
-£(t,x)^dx) 



sup 

te[o,if 



iga+2- 



< 



sup 

te[o,K] 



E 



xea+2- 



C+(t,x)--£(t,x) 



(g'_(x + 2- m )-g'Jx)) 



(18) 



+ o sup 

z t£[0,K] 



x£a+2 



/oo 
-oo 



The first term on the right side of (fig)) goes to zero, since, by Corollary [TJ 
£+(<, a;) uniformly converges to ^£(t,x) on [0, K] x R, and 

(g'-(x + 2" m ) - gL(x)) < g' + {M) - gl(-M) < oo. 

The second term on the right side of (fT5|) also goes to zero, since it is the 
difference of a Riemann-Stieltjes integral and its approximating sum. Recall 
that C(t, x) is continuous, non-decreasing in t for any x, C{K, x) has compact 
support, so bounded, as x varies over R, and the total mass /x(R) < oo. 

Thus by (JTSJ) and (fTT|) . we have established the statements of the theorem 
for measures \i supported in a compact interval [-M, M\. 

Consider now the case of a general g. Let T AI :— inf {t > : \W(t)\ > M} 
for M — 1,2,..., and approximate g by 



ff(M)+ff+(M)(a;-M) if x > Af 
(x) = <( 3 (x) if |x| < M 

g(-M) + g'_(-M)(x - M) if x < —M. 



Define /i M as the measure determined by the change of (g M )'_, which is sup- 
ported in [— M, M] with finite mass. Clearly, fl>(W(t)) = g M (W(t)) if < 
t < T M and, since C(T M ,x) = for all x, \x\ > M, J C(t,x)fi(dx) = 
J C{t,x)^i M (dx) if 0<t <T M . 

The previous argument proves (fT2|) for the interval [0, K A T M ], and also 
(TT3l) if < t < T M . Since the stopping times (T m )m=i increase to oo a.s., this 
proves the theorem. 

□ 

Comparing Ito's formula (jTTJ) and Ito-Tanaka formula (TT5|) when g is C 2 and 
convex, we obtain that 



(W(s))ds= £(t,x)g"(x)dx. 



Since this holds for any continuous and positive function g" , by a monotone class 
argument we obtain the well-known occupation time formula for any bounded, 
Borel measurable function h: 



h(W(s))ds 



h(x)C(t, x) dx. 



(19) 
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As a special case, one gets C(t, x) dx = t. Also, from (fTS)) with g(x) 
■ — a\, one obtains Tanaka's formula 

C(t, a) = \W(t) -a\- \W(0) - a\ - [ sga(W(s) - a) dW(s). 

Jo 

4 Integration of predictable processes 

In this section our aim is to show that when the integrand (Y(t))t>o is a pre- 
dictable process satisfying some integrability condition, our previous approach 
to approximate the stochastic integral by sums of values of the integrand at 
the Skorohod stopping times s m (k), multiplied by ±2 _m , where the signs are 
obtained by a fair coin tossing sequence, still works. In other words, the stan- 
dard non-random partitions of a time interval [0, t] may be replaced by our 
random partitions = s m (0) < s m (l) < • • • < s m ([t2 2m \) of Skorohod embed- 
ding of random walks into the Wiener process. This is not surprising, because, 
as Lemma lAl shows, such Skorohod partitions are asymptotically equivalent to 
partitions (k2~ 2m ) t ^ =0 , as m — > oo. This approach will be described rather 
briefly, since in this case the details are essentially the same as in the standard 
approach to stochastic integration found in textbooks. 

Let (J-"t)t>o denote the natural filtration defined by our BM W. In this 
paper, stopping times and adapted processes are understood with respect to 
this filtration. By definition, we say that Y is a simple, adapted process if there 
exist stopping times < tq < t\ < ■ ■ ■ increasing to oo a.s. and random 
variables £o>£i)-- • such that £j is T Tj -measurable, E(£?) < oo for all j, and 

oo 

Y(t) = £ l {0 }(i) + X>-il(r,._ 1 ,T f .](t) (* > 0). 

r=l 

In the sequel, the only case we will consider is when, with a given m > 0, 
the stopping time sequence is the one given by the Skorohod embedding 
Let Y be a left-continuous, adapted process and with a fixed b > 0, take the 
truncated process 

f b if Y(t) > b 
Y b (t):={ Y{t) H\Y(t)\<b (20) 
[ -6 if Y(i)< -6. 

Further, with m > fixed, take 

oo 

Y r b n (t) :=y fc (0)l {o} W+^r b ( Sm (r-l))l (Sm(r _ 1) , SmW] (t) (i>0), (21) 

r=l 

with the Skorohod stopping times ■ Then Y^ is a simple, adapted process. 
Then, similarly as in (fTU|). a stochastic sum of Yj^ is defined as 

Lt2 2m j 

K'W) t := £ Y\s m (r-l))(W(s m (r))-W(s m (r-l))) 
r=l 

L*2 2m j 

= J2 Y»(s m (r-l))X m (r)2~ m , (22) 
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where (X m (r))^_ 1 is a sequence of independent, F {X m (r) = ±1} = i random 
variables. Observe that (|2"2"|) without the X m (r) = ±1 factors would be asymp- 
totically an ordinary Riemann sum of the integral J* Y h (s) ds. (The differences 
s ra (r) — s m (r — 1) asymptotically equal 2~ m by Lemma 1X1) The random ±1 
factors multiplying the terms of the sum transform it into an approximation of 
a stochastic integral. 

It is clear that the usual properties of stochastic integrals hold for such 
stochastic sums. Namely, one has linearity: (aY^ +(3Z^ n ) ■ W = aY^ -W + fiZ^ ■ 
W, zero expectation: E(Y%-W) = 0, isometry: E(Y^-W) 2 K = /* E(Y^(t)) 2 dt, 
etc. 

Lemma 2. Let K > be fixed and Y be a left- continuous, adapted process such 
that 

\\Yf K := / EY 2 (t)dt < oo. 
Jo 

Then there exists a sequence (m(b))'^L 1 such that \\Y — Y~mCb) || — *" as b — > oo, 
where Y b n is defined by V21)) . 
Proof. First, for any b, 

\\y-Y b \\ 2 K < [ K E(Y 2 (t)l {lY{t)l > b] )dt 
Jo 

< / EY 2 (t)dt<oo. 
Jo 

Therefore \\Y — Y b \\x — » as b — > oo by dominated convergence. 

Further, for b fixed and m — > oo, Y^(t) — > Y b (t) a.s. for all t G [0, K], hence 
ll^ b — ^jnll-K — » by bounded convergence. These prove the lemma. 

□ 

So let Y be a left-continuous, adapted process such that ||y||jr < oo and take 

Jb(t) ■= (^m(f,) " WJ ,t£ [0,K], where Y b ^ is defined by the previous lemma. 

Since Y^,^ tends to Y in L 2 ([0, K] x Vt), by isometry, Jb(t) tends to a random 

variable J(t) in L 2 (Q), which is defined as the stochastic integral L Y(s) dW(s) 
fort G [O.A"]. 

From this point, the discussion of the usual properties of stochastic integrals 
and extensions to more general integrands largely agrees with the standard 
textbook case, therefore omitted. 

5 Extension to continuous local martingales 

The extension of the methods of the previous sections to continuous local mar- 
tingales is rather straightforward. By a useful theorem of Dambis and Dubins- 
Schwarz (DDS), a continuous local martingale M can be transformed into 
a Brownian motion W by a change of time, when time is measured by the 
quadratic variation [M] t . 

Let a right-continuous and complete filtration (Ft)t>o be given in (O, J 7 , P) 
and let the continuous local martingale M be adapted to it. Define 

T s = mi{t > : [M] t > s} (s > 0). 
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Theorem D. If M is a continuous {Tt)-local martingale vanishing at and 
such that [M]oo = oo a.s., then W(s) — M{T S ) is an (J r T a )-Brownian motion 
and M(t) = W([M] t ). 

Similar statement is true when [M]^ < oo is possible, cf. [5J p. 182]. For 
example, M(t) = W([M] t ) still holds for any t > 0. 

Now some results are recalled from [11] . Skorohod-type stopping times can 
be defined for M, similarly as for W in Q. For m > 0, let r m (0) = and 

r m (k + 1) = inf {t : t > r m (k), \M(t) - M(r m (fc))| = 2-™} (k > 0). 

Then s m (k) = [M] Tm(k) and B m (k2- 2m ) = W(s m (k)) = M(r m (fc)), where 
s m (k) is defined by ^ for the DDS BM W and B m is a scaled simple, symmetric 
RW, as in previous sections. 

Let N m (t) denote the following discrete quadratic variation process of M: 

N m (t) = 2- 2m #{r:r>0,T m (r)<t} 

= 2- 2m #{r:r>Q,s m (r)<[M} t } {t > 0). 

The next theorem is a reformulation of [111 Theorems 2 and 4] , correcting some 
unfortunate errors there. 

Theorem E. Let K > and take a sequence (c m ) increasing to oo arbitrary 
slowly. Then for a.e. to 6 Q, if m > mo(w), we have 

sup \[M] t - N m {t)\ < c rn mh- m 

0<t<K 

and 

sup \M(t) - B m (N rn (t))\ < c m mh-^. 

0<t<K 

Thus N m (t) almost surely uniformly converges on compacts to [M] t , and 
B m {N m (t)) similarly converges to M(t). 

Combining the DDS time change and the pathwise approximations of local 
time described in Section O it is not too hard to generalize the local time results 
of [T2] to continuous local martingales M. Let the mth approximate local time 
of Mat a point x £ 2~' n Z up to time t G 2~ 2m Z+ he defined by 

x) = 2" m #{s : B m (s) = x, B m (.s + 2~ 2m ) =x± 2~ m }, 

where s G 2- 2m Zn[0, [M] t ) and B m (j2- 2m ) = M(r m (i)). Then x) can 

be extended to any t G K+ and x G M by linear interpolation, as a continuous 
two-parameter process. 

The major difference compared to Corollary [T] is that that the time interval 



[0, K] has to be replaced by an interval of the form [0, T^] here, where Tl 



U, J- b j I1C1C, W11C1C -L 

Tb A K, over which [M] t is bounded by a constant b. 

Corollary 2. On any strip [0, T b K ] xl, the approximate local times (C^' + (t, x)) 
a.s. uniformly converge as m — > oo to the one half of the local time C (t, x) — 
C w ([M] t , x), where C w denotes the local time of the DDS BM of M. For any 
C > 1, and for any K > 0, b > e, and m > 1 such that b2 2m > N(C), we have 



sup 

(t,x)e[o,T, ir ]xi 



> 50 C63(log&)3m32 
32 



< 



2ra\l-C 
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Similar statements hold for (£ m (t,x)) as well. 



From this, by the Borel-Cantelli lemma, we get that for a.e. oj 6 O, if 

Tn > niQ^uj), we have 



sup 

(t,x)e[o,T^]xi 



~£ M (t,x)-£™£(t,x) 



sup 



1 ~£ M (tAT b K ,x)-£™£(tAT b K ,x) 



< 50 Cbi (log 6)3 mi 2~ ^ . 

If we replace 6 here by a sequence (c m ) increasing to oo, then, by the conti- 
nuity of [M] t , T Cm / oo, and so t A Tf m / t for t e [0, Kj. Hence we have 

Corollary 3. Let K > and take a sequence (c m ) increasing to oo arbitrary 
slowly. Then for a.e. lo £ £l, if m > m^ia), we have 



sup 

(t,x)e[o,K]xi 



1 



M,±i 



Next define ltd integrals w.r.t. M when the integrand is a 'nice' function of 
M. An ltd sum is defined by 

(f(M).M)l n := J2 f(M(r m (r-l)))2- m X m (r) 

T m {r)<t 

= f(W(s m (r-l)))2- m X m (r), (23) 

s m (r)<[M] t 

where 2- m X m (r) = M(r m (r)) - M(r m (r - 1)) = W{s m (r)) - W(s m (r - 1)), 
(X m (r))^Li being an independent, ±1 symmetric coin tossing sequence. 

Theorem 2. Let g be a C 2 -function, alternatively, be the difference of two 
convex functions. Then for arbitrary K > 0, 



sup 

te[o,K] 



(g'_{M).M)\ 



g'_(M(s))dM(s) 







almost surely osm-t oo, and in the first case one gets the ltd formula 



j(M(t)) - g(M(0)) = / g'(M(s))dM(s) 



1 



in the second case one obtains the It6~Tanaka formula 



j(M(t))-g(M(0))= / g'_(M(s))dM(s) 



1 



g"(M(s))d[M] s ; 



£ M (t, x)n( dx). 



The proof is similar to the Brownian case, therefore omitted. 
(|23|) indicates that if the function / is C 1 or of finite variation on bounded 
intervals, then 



f(M(s))dM(s) = 



[M]t 



f(W(s))dW(s). 
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We mention that the DDS BM W and the quadratic variation [M] are in- 
dependent processes if and only if M is symmetrically evolving (is an Ocone 
martingale) in the sense that given the past of M, distributions of future incre- 
ments are invariant under reflection, cf. [TTJ Theorem 5]. 

Finally, we can extend the method of integration of predictable integrands 
discussed in Section 0] to integrals w.r.t. a continuous local martingale M. Let 
K > be fixed and F be a left-continuous, adapted process on [0, K] such that 

\\Y\\ 2 M :=E ( K Y 2 (t)d[M] t <oo. 
Jo 

For b > let F denote the truncated process (|20|) . If m > fixed, take 

oo 

^^^l^jW + ^^^lr-l))!^^!),^^)]^), (24) 

r=l 

for f > 0. Then F^ is a simple, adapted process. 
An Ito sum of F^ is defined as 

(Y^-M) t := Yl Yb ^ r 

T m (r)<i 

= £ F b (r m (r 

T m (V)<t 

where (X m (r))^ 1 is a sequence of independent, P{X m (r) = ±1} = | random 
variables. 

Here too, cf. Lemma [51 there exists a sequence (m(b))^ 1 such that ||Y — 
F^Jm,k — > as 6 — > oo, where is defined by ([21]). So take J b (t) := 

• M) , t e [0,if]. Since F^ (6) tends to F in L 2 M ([0, K]xQ), by isometry, 

Jb(t) tends to a random variable Jit) in L 2 (J7), which is defined as the stochastic 
integral J Q * Y(s) dM(s) for t e [0, if]. 
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